A Comparative Investigation of the Combined Effects of Pre-Processing, Wavelength Selection, and Regression Methods on Near-Infrared Calibration Model Performance.

نویسندگان

  • Jian Wan
  • Yi-Chieh Chen
  • A Julian Morris
  • Suresh N Thennadil
چکیده

Near-infrared (NIR) spectroscopy is being widely used in various fields ranging from pharmaceutics to the food industry for analyzing chemical and physical properties of the substances concerned. Its advantages over other analytical techniques include available physical interpretation of spectral data, nondestructive nature and high speed of measurements, and little or no need for sample preparation. The successful application of NIR spectroscopy relies on three main aspects: pre-processing of spectral data to eliminate nonlinear variations due to temperature, light scattering effects and many others, selection of those wavelengths that contribute useful information, and identification of suitable calibration models using linear/nonlinear regression . Several methods have been developed for each of these three aspects and many comparative studies of different methods exist for an individual aspect or some combinations. However, there is still a lack of comparative studies for the interactions among these three aspects, which can shed light on what role each aspect plays in the calibration and how to combine various methods of each aspect together to obtain the best calibration model. This paper aims to provide such a comparative study based on four benchmark data sets using three typical pre-processing methods, namely, orthogonal signal correction (OSC), extended multiplicative signal correction (EMSC) and optical path-length estimation and correction (OPLEC); two existing wavelength selection methods, namely, stepwise forward selection (SFS) and genetic algorithm optimization combined with partial least squares regression for spectral data (GAPLSSP); four popular regression methods, namely, partial least squares (PLS), least absolute shrinkage and selection operator (LASSO), least squares support vector machine (LS-SVM), and Gaussian process regression (GPR). The comparative study indicates that, in general, pre-processing of spectral data can play a significant role in the calibration while wavelength selection plays a marginal role and the combination of certain pre-processing, wavelength selection, and nonlinear regression methods can achieve superior performance over traditional linear regression-based calibration.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Comparative Study Concerning Linear and Nonlinear Models to Determine Sugar Content in Sugar Beet by Near Infrared Spectroscopy (NIR)

This paper reports on the use of Artificial Neural Networks (ANN) and Partial Least Squareregression (PLS) combined with NIR spectroscopy (900-1700 nm) to design calibration models for thedetermination of sugar content in sugar beet. In this study a total of 80 samples were used as the calibration set,whereas 40 samples were used for prediction. Three pre-processing methods, including Multiplic...

متن کامل

Development of near infrared reflectance spectroscopy (NIRS) calibration model for estimation of oil content in a worldwide safflower germplasm collection

The development of NIRS calibration model as a rapid, precise, robust, and cost-effective method to estimate oil content in ground seeds of worldwide safflower germplasm collection grown under different agro-climatic conditions was the key objective of this research project. The oil content was measured by accelerated solvent extraction method in a total of 328 samples collected across 2004 (16...

متن کامل

Quantitative Comparison of Analytical solution and Finite Element Method for investigation of Near-Infrared Light Propagation in Brain Tissue Model

Introduction: Functional Near-Infrared Spectroscopy (fNIRS) is an imaging method in which light source and detector are installed on the head; consequently, re-emission of light from human skin contains information about cerebral hemodynamic alteration. The spatial probability distribution profile of photons penetrating tissue at a source spot, scattering into the tissue, and being released at ...

متن کامل

Multivariate calibration of near infrared spectroscopy in the presence of light scattering effect: a comparative study

When analyzing heterogeneous samples using spectroscopy, the light scattering effect introduces non-linearity into the measurements and deteriorates the prediction accuracy of conventional linear models. This paper compares the prediction performance of two categories of chemometric methods: pre-processing techniques to remove the non-linearity, and non-linear calibration techniques to directly...

متن کامل

Uninformative Biological Variability Elimination in Apple Soluble Solids Content Inspection by Using Fourier Transform Near-Infrared Spectroscopy Combined with Multivariate Analysis and Wavelength Selection Algorithm

Uninformative biological variability elimination methods were studied in the near-infrared calibration model for predicting the soluble solids content of apples. Four different preprocessing methods, namely, Savitzky-Golay smoothing, multiplicative scatter correction, standard normal variate, and mean normalization, as well as their combinations were conducted on raw Fourier transform near-infr...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Applied spectroscopy

دوره 71 7  شماره 

صفحات  -

تاریخ انتشار 2017